Aligning chemical structure diagrams with local search
نویسندگان
چکیده
Chemists working in biomolecular application projects are usually looking at many related molecules (e.g. results of a virtual screening run, lead series development or library design). For a convenient visual analysis of this data it is essential that differences between molecules are easily detectable. This can be quite difficult if structural similarities are not taken into account while creating molecule structure diagrams. We present a method for generating globally aligned structure diagrams for two molecules following IUPAC standards [1]. Using a set of three coordinate transform operations (ring system flipping, chain flipping and substituent exchange) all correct and overlap-free layouts can be enumerated. If the number of possible layouts is too large, a heuristic is used to iterate through a smaller subspace. Subsequently all candidate layouts are scored with several different terms describing the quality of the layout (number of collisions, stretching of chains...) as well as the relationship between molecules (similarity to a template) and the one with the highest score is chosen. Scoring functions and similarity measures are easily interchangeable and the whole process is fast enough for interactive use. The whole alignment process is verified by calculating the RMSD between aligned and nearest template coordinates. For validation, the new method is applied to many clusters of related molecules from the PubChem compound library. In summary, we have developed a novel SDG algorithm which is of great help for the daily tasks of a modeller by drawing small, related molecules consistently.
منابع مشابه
Identification of BKCa channel openers by molecular field alignment and patent data-driven analysis
In this work, we present the first comprehensive molecular field analysis of patent structures on how the chemical structure of drugs impacts the biological binding. This task was formulated as searching for drug structures to reveal shared effects of substitutions across a common scaffold and the chemical features that may be responsible. We used the SureChEMBL patent database, which prov...
متن کاملOptimal Operation of a DWC by Self-Optimizing Control: Active Vapor Split Approach
Dividing Wall Column(DWC) offers the large potential for operating and capital cost saving in compared with conventional distillation sequence. In the studied DWC in this study, the aid of Vmin diagrams, it is shown that without a suitable value for vapor split fraction bellow the dividing wall in different operating conditions, the energy requirement increases from optimal value and it wil...
متن کاملAutomated extraction of chemical structure information from digital raster images
BACKGROUND To search for chemical structures in research articles, diagrams or text representing molecules need to be translated to a standard chemical file format compatible with cheminformatic search engines. Nevertheless, chemical information contained in research articles is often referenced as analog diagrams of chemical structures embedded in digital raster images. To automate analog-to-d...
متن کاملA Differential Evolution and Spatial Distribution based Local Search for Training Fuzzy Wavelet Neural Network
Abstract Many parameter-tuning algorithms have been proposed for training Fuzzy Wavelet Neural Networks (FWNNs). Absence of appropriate structure, convergence to local optima and low speed in learning algorithms are deficiencies of FWNNs in previous studies. In this paper, a Memetic Algorithm (MA) is introduced to train FWNN for addressing aforementioned learning lacks. Differential Evolution...
متن کاملDrawing Euler Diagrams from Region Connection Calculus Specifications with Local Search
This paper describes a local search based approach and a software tool to approximate the problem of drawing Euler diagrams. Specifications are written using RCC-8-constraints and radius constraints. Euler diagrams are described as set of circles.
متن کامل